Document image understanding: geometric and logical layout
نویسنده
چکیده
Document Image Understanding encompasses the technology required to make paper documents equivalent to other computer exchange media like oppies, tapes, and cdroms. The physical reader of the paper document is the scanner just like the physical reader of the oppy is the oppy drive and the physical reader of the tape cartridge is the tape cartridge drive, and the physical reader of the cdrom is the cdrom drive.
منابع مشابه
Document image analysis with cooperative interaction between layout analysis and logical structure analysis
When a printed document is to be input to a computer system, the document must be converted to a computer-readable format, e.g., ASCII, PDF, RTF, CSV, or SGML/XML/HTML-tagged data. In order to obtain these data formats from a printed document, it is necessary to extract from the printed document as much information as possible, i.e., layout structure (layout objects and their hierarchical relat...
متن کاملGeometric Layout Analysis Techniques for Document Image Understanding: a Review
Document Image Understanding (DIU) is an interesting research area with a large variety of challenging applications. Researchers have worked from decades on this topic, as witnessed by the scientific literature. The main purpose of the present report is to describe the current status of DIU with particular attention to two subprocesses: document skew angle estimation and page decomposition. Sev...
متن کاملDocument Decomposition into Geometric and Logical Layout
We present an Android application for scanning LaTeX documents to determine the logical layout of the document. The algorithm first prepares the image for processing, then determines where text and figures are within a document, and finally classifies these various components of a document. Chapter
متن کاملLogical Layout Recovery: approach for graphic-based features
In contrast to the existing approaches for document analysis and understanding this paper represents a system that considers a logical role for graphic content in predominantly textual, born digital PDF documents. This work was inspired by the idea of using structural graphic objects in order to clarify the logical layout even of complex mostly graphic documents. Based on visual cognition, geom...
متن کاملMachine Learning for Reading Order Detection in Document Image Understanding
Document image understanding refers to logical and semantic analysis of document images in order to extract information understandable to humans and codify it into machine-readable form. Most of the studies on document image understanding have targeted the specific problem of associating layout components with logical labels, while less attention has been paid to the problem of extracting relat...
متن کامل